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I. Requirements of the Clickstream System 

The Clickstream System is a software system residing on several hardware platforms within the BellSouth 
Interactive Video Services Network (IVSN). The system is being designed for the BellSouth Residential Broadband 
Trial which will take place in the city of ChamWee, Georgia from May, 1995 through May, 1997. The purpose of 
the system is three fold: 

1) To provide the capability for the collection of knowledge about the subscriber usage of the BellSouth System 
and Services. This is called Clickstream information. This knowledge will allow analysis of data to generate 
detailed viewing habits, programming ratings, and detailed demographics of the subscriber population. 

2) To provide a method (or methods ) to merge content identifier Metadata with the Clickstream information. 

3) To provide a storage and organization database system to manage and analyze the Clickstream and content data, 
and to provide an upload facility to 3rd party analysis entities, such as NCM l 3 /Optimark. This system has been 
commonly referred to as the MarKeting and Information System (MKIS). 

These 3 main functions of the Clickstream system are tailored to support the analysis of subscriber Interactive Video 
Services Network (IVSN) usage. Some analysis capability will be supported by the MKIS system, but main 
analysis function will be provided off-line by a third party (NCM/BULUArbitron) system referred to as I / 
Optimark. For a more detailed account of data analysis function, refer to NCM/BULUAibitron documentation. 
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2. Clickstream Hardware/Software Entities 

The software subsystems to support the Clickstream collection. Metadata merging, and data storage run on several 
hardware platforms as outlined in figure i. The subsystems communicate with each other in various ways to 
transmit Clickstream information, data error detection schemes, and data acknowledgments. Only the flow of 
Clickstream pertinent data and Metadata are displayed in figure 1. The software subsystems are divided functionally 
within each platform to aid in more parallel design, development and testing efforts. 




Figure 1: Clickstream System Date Flow 



Note: Arrows represent flow of Clickstream data only 
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2.L Software Subsystems 
The following is an overview of expected software subsystems, in-depth functionality will be covered in following 
sections. 



2. 1.1. STB Platform 
There are several parts to the STB based Oickstream Processor code: 

1) Oickstream Kernel 

2) Double Buffer of Oickstream Events 

3) Oickstream Upload Handler 

4) Oickstream Message Receiver, Upload Controller 

5) Oickstream Event API 

The STB based Oickstream Processor will reside in, and be executed from DRAM. It will be downloaded to the 
STB during the Level 2 boot process, and will be designated as a 4< Resident" application by the PowerTV Operating 
System (It will remain memory resident over soft powerdown STB states). 
The Oickstream Kernel will buffer Oickstream Events handed to it by applications. 

The Oickstream Message Receiver will accept control messages over the ESF Pass-through data link to control the 
uploading of information over the reverse path network and will store the payload of these messages in appropriate 
and available memory (NVM when possible). It will also accept the messages sent to it to acknowledge the receipt 
of a number of Oickstream Data Packets. 

The uploading of data will occur on a scheduled basis over every 24 hour time period. Oickstreams will be uploaded 
over the system messaging or "LI Pass Thru" protocol. This scheduling function will allow the distribution of 
uploads evenly over 24 hour periods to minimize network impact. Uploads may also be suspended every day during 
high network usage times (prime time, etc.). 
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Upload Controller, 
Message Receiver 



CHckstream 
Upload 
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CHckstream Data Upload : 
UDP/IP "Pass-Thru" Protocol 
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CHckstream Upload Control TXs: 
UDP/1P "Pass-Thru" Protocol 
downstream on Level 1 Network 



] 



Figure 2: STB CHckstream Journalllng Flow 



2.1.2. HP VTE Platform 

The video server simply acts as a pass-thru device for both STB sourced CHckstream Uploads, and for data ACKs 
returned to the STB. There is a communications router inside the VTE which will re-direct IP traffic to the 
a ppropriate destination. 



2, 1.3. Sybase IMS Platform 
The IMS platform wilt have two functions pertaining to the CHckstream system. One will be to act as a pass-thru 
device for STB sourced CHckstream information going to the Staging Server, and as a pass-thru device for Staging 
Server data acknowledgments to the STB. This communication will be handled through the Sybase Open 
CHent/OpenServcr Software and will take the form of Remote Procedure Calls (RPCs) executed on the IMS. The 
other function will be to replicate IMS log information to the Staging Server for inclusion in the Metadata which is 
further replicated to the MKIS. 
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2,1.4. Sybase Staging Server Platform 
The Sybase Staging Server will have four main functions. 

1) It will act as an endpoint in the Clickstream upload process. This is handled by a process and flat file 
persistant storage called "Event Capture". It will use the RPCs made from the IMS server to pass Clickstream 
information over, and formulate and send data acknowledgments to the STB. 

2) It will handle replicated IMS data logs destined for the MKIS system. 

3) It will have a decompression, parsing, and data merging mechanism which will separate the Clickstream 
data into appropriate relational database entities as oudined in the MKIS specification. 

4) It will act as a persistent storage device from which MKIS database information can be replicated 



2.1-5. MKIS Platform 

The MKIS system will serve several functions within the Clickstream system. The particular design of this database 
and analysis engine is covered extensively in the document "IT Architecture Design Ver. 1.1". A top level 
look at this system reveals 4 main functions. 

1) Mass data repository and relational database structure for t 
Clickstream, subscriber, demographics, billing . advertisement and content information 

2) To enable regular data uploads to third party analysis engines, and to be able to restrict access to sensitive 

data fields. ; Sam > „_ 

3) To perform "native ' data analysis functions and report generations to diflerent specifications. 

Appiicatio r- 
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3. STB Clickstream Processor 
3.1. Overview 

The Clickstream processor is structured to store Clickstream event records in one of two Clickstream double buffers. 
The Clickstream buffer sizes are staticly provisioned by the PowerTV OS as defined in the code download process. 
The buffer size will be optimized during the test and integration phase of the project The possibility has not been 
ruled out to addressaMy download the size of the Clickstream STB double buffer. The size of any particular 
Clickstream event record is variable to handle the different joumaling needs of different applications. An event record 
is the individual logging of a subscriber input to the STB. The generation of these events fill one of the two 
Clickstream buffers. When a buffer is filled, or an upload timer event has expired, the Clickstream processor ts 
flagged to initiate an upload process with the Staging Server based Event Capture application, and the buffer 
becomes locked. Subsequent Clickstream records are stored to the second Clickstream buffer. 

The upload process consists of several steps. The buffer to be uploaded through the network is locked No more 
Event Records can be stored to this buffer. The buffer is then compressed. The PowerTV OS will support the LZW 
compression utilities. These will be employed to compress the buffer. Fifty percent compression is acheivaWewitn 
Oris algorithm on random binary data. Clickstream data roughly fits into this category. The resultant compressed 
data is then divided into transmission "transactions" or "packets" and data is placed into the packet headers i to indicate 
packet ID. IP destination address, etc CRC error detection will either be placed on each packet by the Oickstremn 
Processor (application level), or by the UDP drivers in the OS. This issue is currently TBD. Each data packet]** 
UDP/IP protocol around a LI pass-thru header. The final bit-format level specifications have not yet been received 
from PowerTV, but will be included in this specification on their receipt 

The Clickstream data packets are uploaded over the Slotted-ALOHA data tnmstmtter out of the STB. St ^ 

Server based "Event Capture" application will receive these records and transmit addressable responses to theSTB in 

the form of data acknowledgments. The frequency and periodicity of data acknowledgements wdl 

during the implementation stage of the project Correct implementation of application level data acknowlegements 

require empirical knowlege of network bit error rates, netowork packet error rates, and causes of these errors in 

transmission. 

If the second buffer becomes completely full and the upload process has not completed, then a bufferoverrun 
condition will occur. The last record m each Clickstream buffer is reserved for a buffer ovemm rec^d w, tn a 
timestamp. this essentially tells the Staging Server resident application that STB dicks were missed starting at a 
particular time. This error condition can then be corrected by increasing the buffer sizes. 

(NOTE: Receipt of a buffer overrun record will tell us that buffer sizes have not been set appropriately, aodwewill 
have to re-set buffer sizes, re-compile our code and release it to the system as an update. We may ^°^emeot an 
addressable parameter setting the Clickstream buffer sizes dynamidy. (There are reasons both pro and con to do 
this). Appropriate buffer sizes should be determinable within the first month of integration testing.) 



3.2. Clickstream Kernel 

The Clickstream code will be downloaded to the STB during the Levd 2 (L2) boot sequence as outlined in the 
BdlSouth/Sybase document 'TTV Process Hows Rev. 1.00". It will be bundled along with all Binary r Large 
OBjects (BLOBs) intended to be memory resident . and downloaded at initial AC power on f ollowmg me Ixvet 1 
(LI) bootterm . Bundling of services takes place within the IMS server and takes the form of a ^ ° f 
which is the first item downloaded to the STB on L2 Boot There ,s a memory auocaaon and a P^^^f 
assodated with each of these applications downloaded into DRAM. They are contained in what P°w«TVtermsa 
-Dispatch Table" induded with each downloaded application. For more information on lists of servtces and the code 
download process, see Sybase IMS documentation, and the PowerTV document "Loader Specdlcauon . and 
"Application Download Procedure". 
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3.3. Clickstream Buffering 

The database is allocated into a double buffer (two blocks of memory). Each block is a contiguous free area of 
DRAM which is set aside for this application use only. The buffer sizes should not need to be very large (10K- 
15K?), so allocation of contiguous Mocks should not be a problem. Utilization of more advanced database 
techniques such as linked lists or record pointers should not be necessary in this type of application and would only 
increase the application footprint and complexity. The database buffers should be allocated to allow at least 4 to 8 
hours of peak channel "surfing" between uploads. This will lead to provisioning the actual sire of the data buffers 
after we get some empirical results on subscriber usage. This should keep system bandwidth needs within reason. 
Data compression techniques will be employed to further reduce the transmission bandwidth needs. 

The Clickstream event records consist of data specified by Arbitron/NCM in the document "Database Design 
Specification NCM 1 3 *\ plus additional fields used on an application specific basis to allow for flexible 
documentation of subscriber usage of the interactive system. This data is collected from two sources by the STB 
when an event is registered The application generating the Clickstream event hands off data regarding the 
application state and the subscriber UI which caused the event. The application will also hand over information 
obtained from the OS such as times tamp and STB id. 

3.4. Clickstream Event Format 



3.4*1. General Header Format for Events 
The general event format is tailored to be as flexible as possible. Each application running on the STB will interface 
to the Clickstream routines to send just the data type and format that it needs to journal. 

The general information sent by each application will uniquely identify the time of event generation to the second, 
the application which generated the event and the number of bytes to follow which constitute the information the 
application wishes to journal. 

(Developer Note: The Application ID sent in the API call to the Clickstream Processor and stored in the Buffer 
as shown below is NOT the Application ID given to your application by the Operating System during 
initialization, but the Application ID listed in section 3.4.3 of this document This identifier is unique and static 
and will enable backend systems to parse the events journaled by your application. The application ID given to you 
by the operating system at application initialization is dynamic and will change every time your application is 
launched. If there is no Application ID listed for your application in Table 4 of section 3.43, please contact current 
contact person listed on the cover of this document to have one assigned to you.) 
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Syntax 


Size/Format 


Clickstream Buffer 0 




{ 




Buffer Header Record { 






ui8 


dickstream Version Number 


ui8 


limes tamp 


td8*6 


Number of Bytes 


ui8 


STB MAC Address Most Significant 


uil6 


STB MAC Address Least Significant 


ui32 


Compression Type 


ui8 


> 




Event Record 




for fi=0; i<N: i++) { 




Times tamp 


ui8*6 


BIMs Assisted Application ID 


uil6 


Number Bvtes to Follow (length) 


ui8 


♦♦Application Specific Data** see section 3.4.4 for specific 
formats used by each application. 


ui8 * length 


> 




CHckstream Trailer Record ( 




Times tamp 


iri8*6 


BIMs Assigned Application ID 


iril6 


Number Bytes to Follow (length) 


ui8 


Upload Status Code . 


ud8 






h 





N = Number of Event Records able to be stored in this Clicks tream_Buffer. 

Figure 3: CHckstream Buffer Format 

There air two buffers such that one may be "frozen" during the upload process, and the second buffer can then be 
employed to continue journal mg event records. Both buffers are of identical format This format is shown in figure 
2 above. The notation used in Figure 2 is ISO standard for description of data structures. 

The dickstream Buffers are formatted specifically to ease in their transmission back through the distribution 

network to the Staging Server "Event Capture' data repository. The first section of entries comprise a h ^.™°™ 
which will indicate the time the buffer was first opened, the number of bytes in the buffer, will define the onginatmg 
STB by MAC Address, defines the version of the dickstream Kernel which generated the buffer, and specifies me 
type of data compression used on the following data (if any). This first record is of fixed length and is never 
compressed. All bytes following "Compression Type" are possibly compressed to save in transmission bandwKltn. 

The Oickstream.Trailer record appears at the end of a Oickstream Buffer and usually denotes an error roaHtion. If 
the upload capabilities of the system did not allow the clearing of data from the STB CUckstream Buffers^ This 
record distinguishes itself by the Application ID pointing to a "Missed Records" type so that queries can be 
performed on Uris data in one of our backend systems. The Upload Code is then used to identify the possible cause 
for the error condition. These error codes are outlined in section 3.4.3. 
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3.4, 2, Upload Codes 

Upload Codes in the general event format are meant as an indicator of what state the Clicks tream application was in 
when a buffer overflow occurred. This code indicates what stage in the upload process the dickstream kcrad was in 
at the time of overflow . This will help us determine the cause of buffer overflows, and the appropriate way to solve 
these problems during testing and integration. 



Upload Codes 


0x00 


Not Used 


0x01 


Upload in Progress 


0x02 


Upload Completed, No Acknowledgement Received 


0x03 


Upload Completed, Partial Acknowledgements Received 


OxM 





Figure 4: Upload State Codes 



3.4.3. Application Unique Identifiers 

Each application with the ability to operate on a BellSouth Interactive STB will be assigned a 16 bit unique 
identifier. In addition, each application will have the option of posting an 8 bit Application State identifier within 
it's application specific data. This document, and further revisions, will act as the authority for the assignment of 



Application Identifiers 


0x0000 


Operating System 


OxOOOl-F 


Operating System Sub-Systems (TBD) 


0x0010 


Application Manager 


0x0011 


Cable Television Application 


0x0012 


dickstream Set-Top Box Kernel 






0x0100 


Prevue Interactive Services - EPG System 


0x0101 


Digital Pictures - Interactive Game 


0x01 10-F 


Viacom - MTV / Showtime, etc. 






0x1000 


Interplay Written Applications General ID (TBD) 


0x1001 


Interplay Runtime Eneine 


0x1002 


Interplay Navigator 


0x1003 


Interplay VOD 


0x1004 


Interplay NVOD 


0x1005 


Interplay TownGuide 






0x1100 


The Weather Channel, Weather On-Demand 


0x1101 


Woridspan - Travel On-Demand 


0x1102 


Lightspan - Educational Interactive Application 


0x1103 








OxFFFF 





Figure 5: Application Unique IDs 
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3.4.4. Application Specific Event Formats 

Developer Note: The Clicks tream formats proposed in this section are close to what we currently believe will 
be the final data formats. As we get a better understanding of all of the analysis needs, these data formats may 
change. 



3.4.4.1. Operating System 

The operating system will have the ability to make the same call to the Clicks tream system as any other code 
residing on the STB. It may use this facility to log errors, for testing purposes, to track code usage, or for any other 
reasons. The OS tie-in to the Oickstream system will be predicated by PowerTVs interest in being involved, and is 
not seen as likely in the short term. 



3. 4. 4. 2. Application Manager 
Application Manager is a portion of the "Level 1 Client" as defined in PowerTV operating system documentation. 
This code will be responsible for managing the execution of all applications on the STB. handling application 
switching, removing applications which are mis-behaving, and the like. BellSouth will be interested ut logging of 
all application switching activities. A data format has been established to this end. Implementation of Application 
Manager po-ong Clickstream Events will depend on schedules and interestjevel of PowerTV. 



Syntax 




hop Manager: Application Specific Data 0 








Event Record . 




for (i=0; i<N; i++) { . ■ 




Event ID: See Global Event ID table for Syntax 


iri 16 


Suspend Application ID . — 


iril6 


Initialize Application ID 


in 16 


Application Error Code: See below for Syntax 


ui8 


1 




li 1 








Error Code Syntax 


Data 


Application Manager: Application Error Code () 


u!8 


J 




Do not use (No Error) 


0x00 


Application termination reason unknown m 


0x01 


Missed Watchdog Tuner (Application Keep- Alive) . 


0x02 


Unexpected Session Teardo wn 


0x03 


Memory Error m . 


0x04 


Application will not download _ 


0x05 


Communication Error 


0x06 
0x07 


Network Error 

Ll . - 





Figure 6: Application Manager Event Data Format 
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3. 4. 4. 3. Cable TV Application 

The Cable Application is the second half of what PowerTV calls the "Level 1 Client". It is responsible for tuning 
both analog and digital broadcast services. 



Syntax 




Cable Application : Application Specific. Data 0 




i 




Event Record 




for (i=0; i<N: ( 




Event ID: Sec Global Event ID tabic for Syntax 


uil6 


Channel ID : See Broadcast Channel ID taWe for Svntax 


uil6 


> 




J_ - 





Figure 7: Cable Application Event DaU Format 



The Channel ID 16 bit field is a unique identifier for broadcast networks. In analysis of data, the fact that channel 6 
was watched more than channel 7 has little or no meaning to BellSouth. When the network associate d with t he 
channel is used, the data becomes both clearer and more valuable. To illustrate, the fact that 'TBS" was watched 
more frequently than "MTV" would be valuable information, and is more in line with our end requirements. Thu 
encoding method also allows the data to stay consistent across channel lineup changes that will take place during the 
trial or as the system grows past technical trial this encoding scheme will allow Oickstream data from various 
headends in locations all over the southeast to generate consistent data no matter what their individual channel ^ 
lineups may be. A list of all major broadcast networks in the US appears in the BellSouth John Stefamk authored 
document "Residential Broadband Applications Guide VI. 10". The encoding scheme for these broadcast providers is 
defined in this document in the Metadata formats section. The Event ID as Usted above defines the action which 
occurred on that channel at that time and is also defined in the Metadata section of this document 



3.4.4.4.Clickstream STB Kernel 



The Oickstream Kernel may generate Oickstream events itself to log errors in Oickstream collection and/or 
transmission. We will have a better handle on the types of errors or faults we would like to capture during our test 
and integration phase. This information will be folded into this design document on an as-needed basis. 
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3.4.4.S.Na vigator 

The Navigator is the interactive memring system which enables viewers to select from our many interactive services. 
This is a BellSouth initiative and we will be collecting usage data to analyze how easily viewers are able to access 
progr ammin g they are after. 



Syntax 


Size/Format 


Navigator : Application Specific _Data Q 




i 




Event Record m ; ; 




for (i=0; i<N; ( 




Event ID: See Global Event ID table for Syntax 


uil6 


Application State ID: See table below for Syntax 


ui8 


for(i=0;i<Nl:i++> f 




Character String: ASCII, Variable Length 


b8*Nl 


> 




> 




> 1 








State ID 




Navigator: Application Statc_IP () . 


ui8 


J — 




Fly-Thru m . 


0x00 


Main Menu . m . — 


0x01 


Information (Help) Screen or Video _ . 


0x02 


Movies Sub-Menu . . 


0x03 


Movie Categories Sub-Menu , 


0x04 


List of Movies Sub-Menu . - 


0x05 


Movie Info Screen . . 


0x06 


Movie Buy State m _ . 


0x07 


u . 





Figure 8: Navigator Buffer Format 



3.4.4.6. VOD 



Video On Demand will be launched from the Navigator and will be a BellSouth service, aickstreani , capturing will 
be interested in the amount of pausing. Fast-forwarding and Rewinding people do, as well as what is b«ngwcwedat 
a particular point in time. Some of tins will change as we get a better idea of exactly what events are logged^ 
IMS server. We will try not to duplicate effort wherever possible. If it is determined that the IMS Ev^t Log is 
capturing this level of information, then the dickstream Joumalling of this information is redundant and will not be 
implemented. 
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Syntax 


Size/Format 


Video On Demand (VOD) : Appllcation_Specific_Data () 




f 




Event Record 




for (i=0; i<N; f 




Event ID: See Global Event ID table for Syntax 


uil6 


Amplication State ID: See table below for Syntax 


ui8 


> 




> 








State ID 


Data 


Vnn* Annlimtlon State ID O 


ot8 


-L — — 




Playing 


0x00 


Paused 


0x01 


Fast Forward 


0x02 


Rewind _ 


0x03 


Info (Help) Video or Screen Plaved 


0x04 


reserved 


0x05 


reserved 


0x06 




0x07 


Li 





Figure 9s VOD Buffer Format 



3.4.4.7. NVOD 



This application is also a BellSouth service. The following tables outline expected application specific event 
fonnats, and application states possible within the application. The NVOD application will be active without an 
active Client/Server connection to the IMS. Oickstrcam System will be the oly mechanism for BellSouth to glean 
usage information on this application. 



Syntax 




Near Video On Demand (NVOD) : App_Speeiflc Data () 




i 




Event Record 




for (i=0; i<N: i++) ( . 




Event ID: Sec Global Event ID table for Syntax 


id 16 


Application State ID: See tabic below for Syntax 


ui8 


> 




J ■ 
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State ID 


Data 


NVOD: Application State ID 0 


ui8 


{ _ 




Playing , 


0x00 


Incremental Pause 


OxOI 


Incremental Skip Forward ( . 


0x02 


Incremental Skip Backward (Rewind) • 


0x03 


Info (Help) Video or Screen Played — 


0x04 




0x05 


reserved ■ 


0x06 




0x07 


> 





Figure 10: NVOD Buffer Format 



3.4.4. 8. EPG 

Prevue Guide Interactive Services are expected 83 the Electronic Program Guide (EPG) provider for the BellSouth" 

Interactive Trial. t 



Syntax 




Electronic Program Guide (EPG) : App Specific Data () 

_L_ ■ 




Event Record 

for (i=0: i<N; i++) { . 

Event ID: See Global Event ID table for Syntax 


ml6 


Application State ID: See table below for Syntax . 

reserved , m 


ui8 
iri8 







LL 1 




State ID 

EPG; Application State ID 0 _ 


ui8 


J " 

Initial Display Screen 

Look Ahead Display 4 hour 


0x00 
0x01 


Look Ahead Display 8 hour . . 


0x02 


Look Ahead Display 12 hour 

Look Ahead Display 16hour 

Look Ahead Display 20 hour 

Look Ahead Display 24 hour 

reserved ■ 

J 


0x03 
0x04 
0x05 
0x06 
0x07 



Figure 11: EPG Buffer Format 
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3.4.4.9. Home Shopping 
Expected data formats will be developed following the successful deployment of initial offerings of interactive 
applications in 1995. 

3.4.4. lO.Other 

Expected data formats for a large number of interactive applications will be developed following (he successful 
deployment of initial offerings of interactive applications in 1995. 
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J.5. Applications Programming Interface (API) to Clickstream 



To interface with various third party applications, the Clickstream Processor has utilized the cooperative nature of 
the PowerTV operating system which allows for internal point to point messaging. The PowerTV operating system 
uses the concept of EVENT very broadly to handle everything from thread usage, semaphores, system thread 
contention, timer handlers, exception handling, etc. Using the EVENT system, BellSouth has defined a unique 
Operating System Event Type: 

( kEt_clickStream ), which will be used for inter-process communication with the Clickstream Kernel. This allows 
us to get away from the alternate design of having to support Dynamic Link Libraries (DLLs) for all interested 
applications, and the support and complication it would have introduced into the system. 



Developer Note: Make sure that the difference between the "Operating System concept of Event" and the 
generation of a "Clickstream event" remain somewhat clear to you. they can be easy to confuse one for the other. 

The Clickstream event, or Clickstream Journalled event is a data set that identifies subscriber action on the ITV 
system at a particular point in time. e.g. "Channel was incremented once from channel 5 at 5: 1 2PM on Tuesday" 
The PowerTV Operating System Event system and PowerTV Event data structure are generic ways the OS gets its- 
job done. The system and the data structure are employed to handle a myriad of OS responsibilities. We are simply 
using this Event system as a way to get our Clickstreams from point A to point B within the Set-Top Box. 



3.5. 1. PowerTV OS Events: 

Running under the PowerTV OS, an application can generate Events, receive Events, filter reception of Events, etc. 
The Event itself consists of 5 parameters: 



PowerTV General Event Data Structure 


Size/Format 


Event () 




( 




Code 


Defines the Event Type, Event source, 
what type of device generated the Event, 
and a general 8bit field for Event data. 


iri32 


X 


Application specific data field. 


ui32 


Y 


Application specific data field. 


ui32 


Z 


Application specific data field. 


ui32 


T 


System time when Event was generated, (this is 
from a 64 bit PowerPC register and has no link to 
real world time, it is used for processing the Event) 


iri64 


U . 





Figure 12 : OS Event Data Structure 
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An application will start receiving Events when it uses PowerTV APIs to "register interest" in a particular, or a set 
of particular Events. An application can start generating Events at any time by simply using the appropriate APIs. 
The Code field of the Event is what PTV uses your mask to check against any Events that are generated. A quick 
look at the way the Code field has been structured (above) should demonstrate that there are several ways die Code 
can be used to filter and route Events. When the App registers Event interest it defines an Event mask in terms of 
the Code field. The OS then uses this mask along with all others it has received to decide where an Event should be 
routed, and in which Event Queues it should be placed. When Apps register interest they also specify a DRAM 
memory location called an Event Queue which acts analagous to a data mailbox. If your application asks for a lot of 
Event types, (e.g. "Give me all Events generated by all external devices") you will receive alot of "mail" in your 
Event Queue memory locations, if you ask for only certain Events (e.g. "Give me an Event only when the Power 
button is pressed") , you will receive just sporadic Events in your "mailbox". 



3.5.2. Cllckstream Event type: M kEt_cllckStream M : 



BellSouth has defined a unique Operating System Event Type for use in the BellSouth system, and has done so with 
PowerTV approval. Today this Event Type is defined as a "#define ' in our application source, but will appear in the 
PTVTYPES.h file which PowerTV releases as part of their standard Operating System release in the near future. 
(This #define will only appear in releases of PowerTV for BellSouth use and 3rd party application developers for 
BellSouth.) 

Clickstream Kernel is initialized following Level 2 boot by the PTV OS Application Manager. During it's 
initialization process it registers interest with the PowerTV OS in any OS Event of the "kEt_clickStream type. It 
is the only application which registers interest in this Event type. When an application with focus (or otherwise) 
reaches a point in code where an action worthy ofjoumaUing has occured, it places the contents of the Clickstream 
Joumalled event into a known area of DRAM. It then launches a kEt_clickStream type Event. With a 
kEt_clickStream Event the following syntax is used 



kEt ciickStream Structure 


Contents 


Size/Format 


kEt cllckstream (> 






f 






Code : 


0x00008000 


ui32 


X Clickstream Application ID 


See section 3.4.3 Fig. 5 for ID used 
by each Application. Fill with leading 
Zeros to be ui32. 


ui32 


Y Length of Data 


Number of dick Bytes at DRAM 
location Z. Also leading Zero stuffed 


ui32 


Z : Memory pointer to data 


♦DRAM pointer 


"uL32 


T [Operating system will append the 
system time here, Apps don't have to. 


Application does not use this field. 


ui64 


) 







Figure 13 : Cllckstream Event Type Data Structure 



The Clickstream Kernel will receive the OS Event sent by the application. It will use the mf ormanon rxovided in 
the X Y. and Z parameters to grab the Clickstream data out of it s temporary location in DRAM and into tfie 
Clickstream double buffer area. The Clickstream Kernel will concatenate this dickstreara data at the end ot me 
buffer it is currently maintaining. It performs a level of memory management on the double buffers to maintain 
current pointers of free memory. 
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3.5.3. Clicks tream API Source Code Example 
The following is a source code example of a miscellaneous application making a Clickstream jouraalling call: 



II *******General Header c 0 d e ****************************** 
/ / Apps would place this first part of code once in a header area, 
/ /or make sure it was in a # include 

It define CLICK / / make it easy to pull the code out at compile time 

«ifdef CLICK 

#define kEt clickStream 0x00008000 / /all apps who wish to journal use this #define, it will 

/ / live in the PTVTYPES.h file by end of year. 

#def ine kEd.dickEntry 0 / / event data for normal dick stream event 

#define CUCK BUFFER.SIZE 0x20 / / local storage of one or two dicks, this size wtU be 

/ / slightly application dependent, but 16 to 32 bytes 
/ / should be suf fident for most applications. 

tfdefine APPJD 0x0000 // Application Identifier should be defined here. 

//IDs for all applications are listed in Figure 3 



void *dp tr; / / declare a pointer to local Click data storage 

# end if 



// ** End of General Header Code* 



/ / ******* Code for each key click occurances ****** 
n ifdef CUCK / / make rt e3S Y to P uH lhe code out at com P ile Mine 

dptr = pk Galloc* CLICK BUFFER.SIZE ); / / Application needs to grab some amount of 

r " //DRAM for temporary storage of Clicks locally. 

sprintf((ui8 *)dptr,"clickstream stuff); //place dick data into memory at the *dptr location. 
/ /This is an example of pladng a string of ASCII text at this location. App developers will be 
responsible for writing code to place a data strud as outlined in Section 3 into memofy at dptr. 

pk_PostEvent(kELclickStream I kEdjzlickEntry, APPJD f (ui32)strlen(dptr),(ui32)dptr); 
1 

ffendif 



/ / ******* End of code for each key click occurances 
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Developer Note: 

The code associated with each Key Click will be responsible for Mallocing memory for each Click, and the 
Qickstream Kernel will then De-allocate the 

memory assigned with every Click. Because of this it is imperative that the sending application check to make sure 

that the chekstream application is up and running in background mode. The sending apphcabon can do this by 

frying the Application Manager for an application by the name of 'X3JCKSTREAM". Tins caHwiu also remm 
Haling application the system event que of the Qickstream Processor. If the Clickstream Processor ts NOT 
running in the background to De-allocate this memory, there will be memory leaks associated with each attempted 
joumalling call by the sending application. 
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3.5.4. Clickstream Upload 

Once a Oickstrcam buffer has been filled, or the aickstream Kernel has decided an upload of data is appropriate for 
other reasons (timed upload, low system utilization, commanded upload etc.) the buffer will be formatted, 
compressed, and then uploaded through the system to the Sybase Staging Server. 

The aickstream upload takes place over the system messaging or Level 1 pass-through protocol. The data protocol 
is uni-directional UDP/IP and so there will be application level acknowledgements from the Staging Server during 
the upload process. The STB application will use the PowerTV system messaging APIs to send data packets 
upstream. 
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Figure 14: Upload Data Flow 
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3.5.4. I. System Upload Control I Load Distribution 

There will be several ways for the upload of Clickstreara information to be controlled. A broadcast transaction set 
will be used to set up the upload control method used, or uploads can be commanded by the system. In 
implementation, this means MKIS or Staging Server will source a set of transactions delivered over the Level 1 
pass-through messaging to perform this upload control. The data associated with each STB will be stored in Non- 
Volatile Memory. Because of the small footprint of the data, and the adverse effects that would be caused if the 
upload control data parameters were not present within the STB , this data will reside in NVM 



3.5.4. 1,1. STB Group Assignment for CJickstream Uploads 
In order to assign STBs to a uniformly distributed set of groups during the trial period, the 48 bit STB MAC address 
can be used to randomly distribute boxes within groups in the system. The dickstream Kernel will use the least 
significant 5 bits of the MAC address to determine the dickstream group in which it belongs. This will define 32 
groups within the system. This way a separate addressable transaction data packet for group assignment does not 
have to be defined, transmitted, and received during the trial period. This is a short term solution, and would have to 
be modified to an addressable group assignment methodology concurrent with large scale deployment 
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Figure 15: Cllckstream Group Assignment 



3.5.4.1 ,2.Upload Control Broadcast Message 



This one message depending on how it is configured with data will allow for the following 

1) Cyclic Upload Control to Groups 

2) Cyclic Upload Control to all STBs 

3) Upload on Command/Polling Control (addressable only) 

4) Suppression of Upload on buffer full 

5) Collection Control On/Off addressable 



TCP/IP Protocol Header 


Adaptation Type 


Protocol 
Discriminator 


40 Octet 

Destination: CMC 

Sender VTE 


I Octet 
0x02 = TCP 


I Octet 

(Fixed for BLS Trial) 
0x80 






Adaptation Data 


Messagc_ld 


Request Id 




2 Octet 

Length (Octets to follow. 
Not including Trailer) 


1 Octet 
(Fixed) 
0x60 


2 Octet 
Not Used 


1 Octet 

0x01 = Addressable 
0xO4 = Broadcast 


VTE inserted > 




Level 2- Defined > 
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Destinatton.Address 


L2_DataCount 


DataJTvpe 


VSP Id 


6 Octet 

• MAC Address or 

• OxH«tt-r = Broadcast 


2 Octet 

# Octets to Payload End 
Variable 


1 Octet 

0x00 = User Data 


1 Octet 
0x01=BIMS 


Level 2-Deflned > 



CUckstream Upload Control (Transaction_Code and Transaction.Payload) 


Octetf 


Contents 


T 0 


Transaction Code MSB = 0x80 


T 1 


Transaction Code LSB =0x10 


o 


Clicks tream Processor Version Number 


1 


Global 

Flag (bl) 


Addressable Flag 1 Group Address - Denotes the group of STBs to fidd this 
(bl) 1 transaction (b5) 


2 


Collection On/Off Key - Will turn CUckstream collection On/Off to a STB or 

Group of STBs (non-Global only) 


3 


Perform Upload Now Key - Will perform an upload on command 

Will only upload on command if Global Hag is NOT set 


4 


Suppress Upload on Buffer Full - Will keep the STB or Group from uploading when buffer is full. 

The STB or Group will only upload on it's appointed upload cycle. 


5 


Upload_Cyde_Definition - A STB will have 1 to 4 possible upload cycles defined. 

Tbis will define any one of those cycles. 


6 


Cycle First Occurrence Start Time (Total b48) - Year (b8) 
Defines the time of the first upload in cycle. 


7 


Cycle First Occurrence Start Time - Month (b8) 


8 


Cycle First Occurrence Start Time - Day (b8) 


9 


Cycle First Occurrence Start Time - Hour (b8) 


A 


Cvcle First Occurrence Start Time - Minute (b8) 


B 


Cycle First Occurrence Start Time - Second 


C 


Upload Duration (Total b24) - Hours(0-24) (b8) 
Defines a duration of time over which the STB randomizes upload start time. 


D 


Upload Duration 


Minutes(0-59) (b8) 


E 


Unload Duration 


Seconds(0-59) (b8) 


F 


Cycle Time (Total b32) - Days (0-14) (b8) 

Defines the oeriodicitv between uploads. This is the mean time between uploads. 


10 


Cycle Time 


Hours(0-24) (b8) 


11 




Nfinutes<0-59) (b6) 


12 


Cycle Time 


Sccoods(0-59) (b8) 


0x13 
to 

0x27 


reserved 









TCP/IP Protocol Trailer 



CRC Checksum, etc 



VTE inserted > 

Figure 16: Upload Control Transaction Definition 
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Data Dictionary: 

The Clicks tream upload cycle consists of several parameters which define a start time and a cycle on which the 
uploading of data happen. The Tirst occurrence ' defines a starting time in history from which the cycle runs. The 
"cycle time" defines the amount of time which elapses between periods of the upload cycle. When a cycle is 
complete the Upload duration ' time starts, and the STB Oickstream Processor will randomize an exact upload time 
within the Upload Duration, This will distribute the network load relatively evenly over the entire Upload Duration 
period. 



Upload Duration randomizes time to perform Clicks tream 
upload over this period of time 




Uploadof 
data occurs 




Cycle Time 



z 



2SZ 



Cyde First Occurence Start Timr 



Time 



Figure 17: Upload Cycle Diagram 



An example of the use of these parameters would be to define a period of time every day for STBs to upload data. 
BellSouth has a requirement to perform analysis of uploaded data on a daily basis. This should be available every 
morning towards the beginning of the work day. Peak usage of broadcast Prime Time and of Interactive services will 
typicly be from 7pm until Midnight. This would be a time period when uploads should be inhibited to limit the 
burden on the Level 1 distribution network, which will be busy with exclusive session setups and tear-downs. 
Beginning at Midnight. Clicks tream uploads would begin. In order to have all STBs upload before 8am, and given 
the number of upload groups within the system to be 32, each group would perform uploads over a 15 minute 
period. To achieve this upload cyde: 

Cyde_first_Occurance_Stert_rime = 12:00am Jan 1. 1995 + "X" * 15 minutes. 

Each upload group would have this parameter staggered by 15 minutes. 

Cyde Time =24 hours 

Upload Duration = 15 minutes 



A total of 4 upload cydes will be able to be defined for each group of STBs. This would allow for weekly uploads, 
or any other combination of cydes to work around peak LI load times. 
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3. 5. 4. 2. Communications Procedures for Upload 

The STB will initiate the upload process. When the STB has determined it is appropriate to perform an upload of 
Clicsktream data (for any of the reasons covered in section 3.5.4), it will lock the DRAM buffer actively being 
filled, it will compress the buffer using the LZW compression utility, and it will perform the upload of the 
Clickstream buffer over the Li Pass-Through network. 

The STB initiates reverse path LI Pass-Through by calling the "sess.MessageRequesr PowerTV API. Within the 
parameters of this API are the addressing Mode, the destination address, the size of the "message" packet to send, and 
a pointer to the data in memory to send upstream. 

re sess.MessageRequest ( ui8 mode, ui8 address(6], uii6 sp_count, ui8 *sp_data); 

From this point, the PowerTV operating system and lower level drivers will handle the uploading of the message, 
including the insertion of the UDP/IP protocol layer and CRC-32 transport trailer. 

There WILL be a limit to the size of the data allowed in a single reverse path LI Pass-Through transaction. Today, 
this is not explicit in PowerTV documentation, but in disclosure documentation it is stated as 252 Octets. 
Because most Clickstream uploads will require more than this, the Clickstream Processor will be making sequential 
calls to this MessageRequest API to complete a single upload process. 

There are several different addressing modes available in LI Pass Through. The one that aickstream Processor will 
be using will identify a VSP as the destination of these data packets. That address_mode is 0x03. The address 
following this address_mode is a 2 Octet SP JD (short for VSP identifier). It should be right justified and 0 filled 
into the 6 byte field available. The sp_count and *sp_data are simply the size and pointer to the data to be uploaded. 
All of the data to be uploaded appears as "Payload" to the STB, the signalling network, the CMC, and the Event 
Capture mechanism on the Staging Server back end Level 2 system. 
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IP Header 


UDP Header 


Adaptation Type 


Protocol 
Discriminator 


20 Octet 

Destination: CMC 
As defined in RFC 791 


8 Octet 

As defined in RFC 768 


1 Octet 
0x01 = UDP 


1 Octet 

(Fixed for BLS Trial) 
0x80 








Adaptation Data 


Message a Id 


Requested 


Address^Modc 


2 Octet 
0x0000 or 
srtTuential counter 


1 Octet 
(Fixed) 
0x60 


2 Octet 
NotUsed 


1 Octet 

Address of VSP 
0x03 = "SP ID" 




Application-Defincd> 






Destlnatlon_Address 


L2 DataCount 


IP Address of AS 


Service ID Lenath 


6 Octet 

• Address of VSP 


2 Octet 

# Octets to Payload End 
Variable 


4 Octet 
(Fixed) 

• IP Address of App Server 


4 Octet 
TBD 










Service ID Value 


STB MAC Address 


Data Type 


VSP Id 


4 Octet 
TRD 


6 Octet 

♦ Hardware Address of STB 


1 Octet 

0x00 = User Data 


1 Octet 
0x01 = BIMS 





riirk«freiim Unload Data Packet (Transaction Code and Transaction_Payload) 


Octetf 


Contents 


T 0 


Transaction Code MSB = 0x80 


T 1 


Transaction Code LSB =0x18 . . 


0 


Clicks tream Processor Version Number , 


1 


Upload Sequence Number ; . 


0x02 


Cllckstream Upload Buffer Data Structure (as outlined m Figure 3 ) 


thru 
OxFA 


The data stmcture is broken up into as many reverse path transactions as are ncccessary to perform 
the complete upload of data. 





Control 


Lenath 


CRC-32 


2 Octet 


2 Octet : # Octets from 


4 Octet 


(Fixed) 
OxFFFF 


IP Header to Payload End 
Variable Length 


CRC-32 Calculation 


PowcrTV OS Inserted 





Figure 18 :UpIoad Transaction Definition 
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j Upload 



NCM/I' 
Analysis Engine 

"* 



dk bream 
Uploads 



Sybase 
IMS 



BellSouth 
MKIS 



HP 
VTE 



I 



Sybase 
Staging 
Saver 



LI 
Network 



QAM 
Modulator 



QPSK 
Modulator 



Gonnection 
Management 
Controller 
(CMQ 



QPSK 
-EtemodL 



CfickstieamDa* 



Acknowledgment 



rrrn 



. Okbtream packets (St-Ak>ka)~ 
H UUFAHFioIocoI 



STB 
TTT1 I 



Figure 19: Upload Data Flow 2 



3.5.4.3. System Acknowledgements on CUckstream Buffer Uploads 
The Event Capture Procedure running on Staging Server will be responsible for formulating and sending some leve 
of data acknowledgements to each STB engaged in the upload process. The details of the acknowledgement process 
is still a work-in-progress. The acknowledgements will go out as addressable downstream Level 1 Pass-Through 
transactions. They will enter the STB through the UDP/IP protocol stack, and be handed to the Qickstream 
Processor as a PowerTV Event which the Processor has registered an interest in. 
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TCP/IP Protocol Header 


Adaptation Type 


Protocol 
Discriminator 


40 Octet 

Destination: CMC 
K«vfer VTE 


1 Octet 
0x02 = TCP 


1 Octet 

(Fixed for BLS Trial) 
0x80 








Adaptation Data 


Message Id 


Request Id 


Address Mode 


2 Octet 

Length (Octets to follow, 
Not including Trailer) 


1 Octet 
(Fixed) 
0x60 


2 Octet 
Not Used 


1 Octet 

0x01 = Addressable 


VTE Inserted > 


Level 2- Defined > 






Destination Address 


L2 DataCount 


Data Type 


VSP Id 


6 Octet 

• MAC Address 


2 Octet 

# Octets to Payload End 
Variable 


1 Octet 

0x00 = User Data 


I Octet 
0x01 = BIMS 


Level 2-Deflncd > ' 




Trausaction_Code 


Transaction_Payload 


2 Octet 

Defines TX.Payload 
O18OIF 


Data Acknowledgement Field (TBD) 

Application Level Data , , 









TCP/IP Protocol Trailer 



CRC Checksum, etc 



VTE inserted > ■■ 

Figure 20 : Acknowledgement Transaction Defined (As Sent by Staging Server) 
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3.5.4.3. 1.UDP/IP Protocol Header 



The UDP/IP protocol stack within the STB will be handling insertion of the UDP/IP headers and transport trailer for 
LI Pass-Through transactions transmitted upstream. The header is directly from the RFC 791 and RFC 768 
specifications. They are as follows: 



IP Header 


IP Version 1 Header Length I Type of Service 


Total Length 


Identification 


Flags 1 Fragment Offset 


Time to live 1 Protocol 


Header Checksum 


Source IP Address 


Destination IP Address 




UDP Header 


Source Port 


Destination Port 


Length 


Checksum 



Figure 21 : IP and UDP Header Definitions 



3.5.4.3.2.Data Compression on Upload 
The data compression algorithm LZW has been selected by PowerTV as a utility they will be providing t 
applications from the Operating System at some point in the future. They will be using LZW primanly 
decompression of large BLOBs , graphics files, data objects, etc. The Clickstream processor will also us 
algorithm to compress the Clickstream buffer before data is uploaded through the system. 

The LZW Compression algorithm will be treated in detail in the next release of this document. 



3.5.4.3.3.Level 1 Pass-Through Mechanism to IMS 
This is currenUy a work in progress. APIs have been defined but have not been delivered to BellSouth. 
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4. IMS Server Functions 

4.1. IMS Sourced Logs 
The IMS will be the source for content Metadata regarding all session based services, and in addition will act as a 
subset of the Clicks (ream information gathered on the session based services offered. 



4.2. Pass Through on CUckstream 

This interface does little more than act as a gateway for the reverse path LI Pass-Through Transactions to flow. It 
does not manipulate the contents of the LI Pass-Through Transaction. The IMS will wait on an open connection to 
the "Event Capture" process running on the Staging server, and will pass CUckstream data over that connection a« it 
is received. The interface to LI Pass-Through on the Staging server side is called by the following function call. 

Int IMS_P_Ll_Pass_Through ( Int Size, VarBlnary Payload); 
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5. Metadata Formats 

Content Metadata is information in an encoded form which describes the programming shown on the network and the 
time and broadcast or cable network on which the programming was shown. In the term "Content" we are including 
programming of virtually any type which is transmitted to the subscriber's Set-Top Box (STB). There are a number 
of sources for this information and it will be of utmost importance to manage this information appropriately. For 
this system, we will be interested in the merging of Content Metadata into our Clicks tream data primarily for the 
analysis of Broadcast services provided by the Cable Television Application (Cable App) and VOD/NVOD usage. 
We will also be interested in the numbers of STBs tuned to other interactive services when analyzing ratings and 
general viewing habits of our subscriber population. The merging of Content Metadata into our dickstream data 
adds the relevant information upon which most of our analyses will be based. 

The sources of Content Metadata for this system are shaping up to be: 

1) EPG System: General broadcast programming content. 

2) Broadcast Advertising: Advertising content occurrences on most national , local, and cable 
broadcast networks. This will not include some spot advertisements generated by local broadcasters, and will not 
include any spot advertisements generated within the BellSouth system. 

3) IMS System Logs : The IMS system will Log all stream occurrences, and the STB or STBs 
associated with the stream. This will primarily be in the form of VOD, NVOD, and interactive application 
execution (i.e. the Navigator). . 

4) Advertising Traffic Controller: This system is still in very formative stages at the tuneof- _ 
this writing. It will be a BellSouth system enabling the control of advertisement traffic through the BellSouth 
system throughout the interactive video trial. This system will hand off advertisement Metadata which is generated 
in-house by the insertion of advertisements in both broadcast and session-based services. 

There will also be system globally defined Metadata. There are lookup tables by which all sorts of other contextual 
information may be determined. These tables should be global to the BellSouth System were possible. Many, if 
not all of these tables are defined in this section. 

The dickstream system will attempt to resolve date formats coming from the various sources by defining a global 
data storage format in the MKIS system, and resolving any variations from that global format with the Metadata 
providers before the system comes on-line. If all variations cannot be resolved at their sources, an additional parsing 
mechanism will have to be implemented. 

In addition, there will be Content which is reported by several different sources. We will have to resolve these 
"Duplicate Occurrences'* for our Content data to be as accurate as possible. 



5.1. Globally Defined Metadata Formats 

Content Metadata is logged in a single main tabular form. The Metadata IDs are then resolved byseveral supporting 
data tables. In the MKIS system this data structure will take the form of a relational database model. 



5.1.1. Global Event ID Definitions 

A system global method for defining Event IDs is needed by every application to provide a context by ^chtfae 
event was important enough to be joumaled. This could be as simple as a broadcast channel change by pressing toe 
Chan Up key. It could be a passive change in the content shown on the network. AU of these event types are 
gathered into a single table for purposes of easing the data analysis process. No single application will make use ot 
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every possible event type. In most cases it will benefit the system overall for applications to make use of as little 



1 EVENT 


DEFINITIONS 


1 Content Related Events 


1 0x0000 1 Passive Chance in Content 


1 Direct Key Presses 


1 fYrfWH 


TV <r> iTV Pressed 


| UXUUUZ 


. Unvl 1 lVOOVU 


| UXUUUj 


\juc \i / nw»w 


1 fWIYYVl 
| UXvWArr 


Two 0\ Prr^Mi 


1 uxuuu d 




1 UXUIWO 


!aJUT \*tJ rivoovU 


I UXUUU/ 


nvc ^»jf r iw3w 


1 UXUWO 


oix \Oj rTcssca 


1 uxuwy 


OCVCO \ / J rTCJwCU 


1 UXUUU A 


ugni \0 j ricoocu 


| UXUUUd 


,nluc yyj rivootu 


| UXUUUl— 


tJCiXt \\ff rivoJKU 


1 uxuuuu 


* [piiiii« up ricojcu 


1 uxuuu ts 


vJVmilQ LAJWU r ivoovu 


1 UXUUUr 


v otumc up rrcoscu 


1 UxUUlU 


v oiumc LAjwii ricaocvi 


1 UXUUl 1 


L«3Sl v^UoiniCl i IvooCvl 


1 0x0012 


Favorite Channel Pressed 


10x0013 


Guide Kev Pressed 


1 0x0014 


Theme Kev Pressed 


10x0015 


Display Kev Pressed 


10x0016 


Mute Key Pressed 


1 0x0017 


A Key Pressed 


1 0x0018 


B Key Pressed 


1 0x0019 


C Kev Pressed 


IoxOOIA 


Info Key Pressed 


IoxOOIB 


Backup Kev Pressed 


jOxOOlC 


Main Menu Key Pressed 


JOxOOlD 


Up Arrow Kev Pressed 


lOxOOlE 


Down Arrow Key Pressed 


lOxOOlF 


Rieht Arrow Key Pressed 


10x0020 


Left Arrow Key Pressed 


10x0021 


Enter Kev Pressed 



0x0022 


Play Pressed 


0x0023 


Rewind Pressed 


0x0024 


Pause Pressed 


0x0025 


Record Pressed 


0x0026 


Fast Forward Pressed 


0x0027 


Stop Pressed 


ADolication/State Switching Related 


0x0028 


AC Power ON 


0x0029 


Application Switch (Normal) 


0x002A 


Application Switch (Abnormal) 


0x002B 


Application Terminated (Normal) 


0x002C 


Application Terminated (Abnormal) 


0x002D 


Soft Power OFF 


0x002E 


Soft Power ON 


0x002F 


OFF State Polling Event 


General 


0x0030 


Direct Channel Change 


0x0031 


Mute 


0x0032 


Un-Mute 


0x0033 


Volume Change Below 50% 


0x0034 


Volume Chance Below 25% 


0x0035 


Volume Change Below 10% 


0x0036 


Volume Change Above 50% 


0x0037 


Volume Change Above 25% 


0x0038 


Volume Change Above 10% 


0x0039 


Change to Interactive Mode 


0x003A 


Change to Broadcast Mode 




Figure 22: Global Event ID Definitions 
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5. 1.2. Timestamp Format 

The timestamp format for all Clickstream events throughout the system will be based on a 6 byte (48 bit) 
subset of the ANSI C and C++ standard TIMEJT data structure. It will identify a unique time since 
the year 1900 down to a one second resolution. Some of our content Metadata providers will only be able to give us 
data down to a one minute resolution, if this is the case, these Metadata records will have to be appended or modified 
defaulting the "second" field to 00. The following is an overview of the TIME_T data structure. 





<time.h> 






int 


tm sec 


seconds after the minute 


rt).6i) 


int 


tm min 


minutes after the hour 


(0,59) 


int 


tmjiour 


hours since midnight 


VP) 


int 


tm mdav 


dav of the month 


run 


int 


tm mon 


months since January 


(0,11) 


int 


tm_year 


years since 1900 


(0, 256) 



Figure 23: Global Time Format 



not used int tm_wday days since Sunday (0,6) 

not used int tm_yday days since January (0356) 

not used int Unjsdst Daylight Savings Time Rag 

The entire data structure as defined in the ANSI C spec is a full 9 Bytes. The last 3 bytes deal with defining the day 
of the week, the day of the year, and a daylight savings time flag. These could very easily be extrapolated after eveui 
generation in the MKIS system if needed, and it is not dear that they would be needed for any of the known 
analyses. The elimination of these three bytes from the Clickstream event decreases storage and bandwidth needs. 



5.1.3. Global Broadcast Channel Identifiers 

The following table defines a coding scheme for national and local broadcasters which will be carried on the 
BellSouth network during the Residential Broadband Trial. This list may grow as channels are added, but the 
identifiers will remain unique and constant for each channel. Any application joumaling off events which occur 
while subscribers arc viewing broadcast television will be able to identify the network carrying the programing 
content by using a subset of this table. In this way channel lineups can be changed over the life of the residential 
broadband trial, (and presumably a larger scale BellSouth services deployment), and the identifier for a netwo* 
would stay the same. Data repositories will be relieved of the task of mapping an ever-changing channel number to 
a network. 
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10x0100 to 



10x01 1 F 



News/Talk Shows 



10x0100 



CNN 



10x0101 



Headline News 



10x0102 



The Weather Channel 



10x0103 



CNBC 



10x0104 



CSPAN 



10x0105 



10x0106 



10x0107 



10x0108 



10x0109 



lOxOlOA 



10x0120 to 



I0X013F 



10x0120 



10x0121 



10x0122 



10x0123 



10x0124 



10x0125 



10x0126 



10x01 40 to 



CSPAN-2 



America's Talking 



Talk Channel 



Court TV 



The Crime Channel 



National Empowerment TV 



Sports 



ESPN 



ESPN-2 



SportSouth 



The Golf Channel 



Classic Sports Network 



Prime Network 



NewSport 



0X015F 


Music I 


0x0140 


MTV I 


0x0141 


VH-1 1 


0x0142 


Countiy Music Television 1 


0x0143 


The Nashville Network 1 


0x0144 


The Box 1 


0x0145 


Video Jukebox 1 


0x0146 


MOR Music TV 1 


0x0147 


Music Choice 1 


|ox0160 to 




I0X017F 


Shopping 1 


lox0160 


QVC 1 


|ox016l 


QVC-2 


10x0162 


Home Shopping Network 1 


|0x0163 


TV Macy's 


10x0164 


Cataloe 1 


10x0165 


S, The Shopping Network 


|0x0l66 


Cupid Network 


|0x0180 to 




lox019F 


Movies 



10x0180 



1 Turner Classic Movies 



iBfpadcast Channel ldantmcatloQ_Iflfalfl| [ 0x0181 



[American Movie Classics 



0x0182 



TNT 



10x0183 



Popcorn Channel 



10x01 AO to 



lOxOIBF 



iRellqious 



10x01 AO 



I Faith & Values Channel 



10x01 Al 



| The Inspirational Network 



|0x01A2 



| Trinity Broadcasting Netwotk 



|0x01A3 



I Eternal World TV Network 



|0x01A4 



(The Gospel Network 



l0x01C0 to 



I0X01DF 



[Health & Self- 
l lmprovement 



|0x01C0 



I Lifetime 



lOxOlCl 



[Cable Health Club 



|0xOlC2 



| The Hesflth Channel 



|0xOiC3 



I Parent Television 



10x01C4 j Recovery Network / The Wellness 

J Channel 

|0x01E0 to 



lOxOIFF 



ICultural/Ethnic 



IOxOIEO 



IA&E 



lOxOlEl 



BRAVO 



|0x01E2 



I EI 



|0x01E3 



I Ovation 



|0x01E4 



BET 



|0x01E5 



I BET International 



0x0 1E6 



BET on Jazz 



|0x0lE7 



I World African Network 



|0x01E8 



lUnivision 



|0x0lE9 



IGalavision 



lOxOlEA 



Pfelemundo 



lOxOlEB 



GEMS 



lOxOlEC 



1 International Channel 



10x0200 to 



10 x 0 2 1 F 



Educational 



10x0200 



I The Discovery Channel 



10x0201 



[The History Channel 



10x0202 



I The Learning Channel 



|0x0203 



I Mind Extension University 



10x0220 to 



I0X023F 



Kids 



|0x0220 



Nickelodean 



|0x0221 



I Cartoon Network 



10x0240 to 
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*(W99 



I0X02SF 



General 



10x0240 



10x0241 



The Family Channel 



10x0242 



USA Network 



0x0243 



tx 



0x0244 



WGN 



10x0245 



WWOR(NY) 



10x0246 



WPDC (NY) 



10x0247 



10x0248 



10x0260 to 



|0x027F 



10x0260 



10x0261 



10x0262 



10x0263 



10x0264 



10x0265 



10x0266 



10x0267 



10x0268 



10x0269 



|0x026A 



|0x026B 



|0x026C 



BS 



KTLA(LA) 



KTVT 



Specialty 



Sci-Fi Channel 



Comedy Central 



Sega Channel 



Nostalgia Television 



Americana Television Network 



The Collector's Channel 



TV Food Network 



The Travel Channel 



Jones Computer Network 



The Game Show Channel 



The Ecology Channel 



Home & Garden TV 



Kaleidoscope: America's Disability 
Channel 



0x026D 


The Military Channel 1 


0xO26E 


The Navy Channel 1 


0xO26F 


The Singles Channel I 


0x0270 


The Musician Channel I 


0x0271 


Romance Classics 1 


|0x0280 to 




0X02AF 


Premium Channels 1 


0x0280 


HBO 1 1 


10x0281 


HBO 2 1 


|0x0282 


HBO 3 1 


|ox0283 


Cinetnax 1 1 


lox0284 


Cinemax 2 1 


|0x0285 


The Movie Channel 


|0x0286 


Showtime 1 1 


|0x0287 


Showtime 2 1 


|0x0288 


Showtime Espanol 1 


|0x0289 


Showtime Family 


|0x028A 


Showtime Film 


|ox028B 


Showtime Comedy 



0x028C 


Showtime Action 


0x028D 


The Disney Channel 


0x028E 


Adam & Eve 


0x028F 


Spice 


0x0290 


Spice 2 


0x0291 


Theatre Vision 


0x0292 


Playboy 


0x0293 


Request TV 


0x0294 


Viewers Choice 


0x0296 


Your Choice TV 


0x0297 


Cable Video Store 


0x0300 to 




uxuoo r 


ncfll RroAHcflftt 
'rnvid Arfi 


VAvJvv 


WSB 


0*0101 


WAGA 


0x0302 


WGTV. 


0x0303 


WXIA 


0x0304 


WTBS 


0x0305 . 


WPBA 


0x0306 


WATL 


0x0307 


WGNX 


0x0308 


WVEU 






OxFFEO to 




OxFFFF 


Corner Cases / 
Error Conditions 


OxFFFEO 


Provider To Be Defined 


OxFM-El 


Provider Unknown 


OxFFFFF 


General Error 
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5.2. Content Metadata Formats 

Metadata regarding programming content provided in the BellSouth system is necessary to tie events which occur on 
me STB to what content the event may have been linked with. ,4 How many subscribers changed the channel during 
every Coca-Cola commercial 7\ "Do subscribers channel surf during some programs more than others7\ etc. All 
possible queries against our usage data will be enabled only by our ability to tie content to the services provided. 
The following is a general format for all content Metadata. A main table (taWe 1) ties a piece of programming 
content to a particular channel identifier between an accurate start and end time. The table 2 simply supports the ID 
scheme used in table 1 to minimize data storage and transmission capacities. Other Channel and Time encoding 
schemes are defined in section 5. i above. 





Content ID 


Channel ID 


Start Time 


End Time 


Exarrtf 


>les 


0x012345FF 


0x002F 


OxOOOOFEDCBAOO 


0x0000FEDCBF98 




0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBFFF 



Table 1 





Content ID 


Content Unique Descriptor 


Content Type 


Examples 


0x01234568 


4< MURPHY BROWN, #39" 


0x04 




0x01234567 


4< HERBIE RIDES AGAIN" 


OxOF 



Table 2 





Content Type 


Content Type Descriptor 


Examples 


0x04 


Situation Comedy" 


0x05 


'^Movie, Drama" 




0x06 


"A<frertisement" 


OxOF 


4< Movie, Comedy" 



Table 3 



Global Content Data Formats: 

Content ID : 32 Bit unique identifier for content 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 48 Bit standard C / C++ *TIME_T* data format for description of time. 

End Time: 48 Bit standard C / C++ *TIME_T" data format for description of time. 

Content Unique Descriptor: Variable length character string describing programming content. 
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5.2.1. Electronic Program Guide Metadata Format 

This is an example of the types of data we will expect to be sourced by Prevue Guide during the Residential 
Broadband Trial period. Data formats can be expected to be more complex than that represented here. These data 
formats are expected to finalize over the next month. 

Prevue Guide Metadata is replicated forward from the Prevue Guide serv er to the Staging Server through a Client / 
Server replication RPC. 





Content ID 


Channel ID 


Start Time 


End Time 


Examples 


0x012345FF 


0x002F 


OxOOOOFEDCBAOO 


Ox0000FEDCBF98 




0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBFFF 



Table 1 





Content ID 


Content Unique Descriptor 


Content Type 


Examples 


0x01234568 


"MURPHY BROWN, #39" 


0x04 




0x01234567 


4< HERBIE RIDES AGAIN" 


OxOF 



Table 2 



Electronic Program Guide Content Data Formats: 

Content ID: 32 Bit unique identifier for content 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 48 Bit standard C / C++ "TIMELT* data format for description of lime. 

End Time: 48 Bit standard C / C++ TIME JT data format for description of time. 



5.2.2. Advertisement Metadata Format: 
The format of advertisement data to feed into the MKIS would take the form of two tables. The first table would be 
the time/channel grid relating an advertisement identifier to a channel ED at a particular start and end time. The 
second would be a table relating advertisement identifiers to the actual advertisement content as described in a 
character string, the advertisement identifiers would describe an advertisement content for the life of the systemi c. 
IDs would not be recycled). The reporting agency would use our existing Channel ID metadata, and any other 
BellSouth standard encoding scheme for their content where possible. 
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Advertisement ID 


Channel ID 


Start Time 


End Time 


Exaraf 


>le 


0x01234567 


Ox002F 


0x0000FEDCBA98 


OxOOOOFEDCBAFF 



Table 1 





Advertisement ID 


Advertisement Unique Descriptor 


Example 


0x01234567 


"Coca Cola Corporation Ad # 1357" 



Table 2 



Advertising Data Formats: 

Advertisement ID : 32 Bit unique identifier for advertising content. 

Channel ID : 16 Bit unique identifier for programming network. 

Start Time: 32 Bit standard C and C++ 'TTMEJF data format for description of time. 

End Time: 32 Bit standard C and C++ 4 TEME_T' data format for description of time. 

Advertisement Unique Descriptor. Variable length character string describing advertisement 



5.2.3. IMS Log Metadata Formats 



5.2.4. BellSouth Advertising Traffic Control Metadata Formats 

The format of in-house advertisement data to feed into the MKIS would be identical to third-party sourced data. 



1 Advertisement ID 


Channel ID 


Start Time 


End Time 


Example 


0x01234567 


0x002F 


0x0000FEDCBA98 


OxOOOOFEDCBAFF 



Table 1 





Advertisement ID 


Advertisement Unique Descriptor 


Examr. 


>le 


0x01234567 


**Coca Cola Corporation Ad # 1357" 



Table 2 



Advertising Data Formats: 

Advertisement ID : 32 Bit unique identifier for advertising content. 

Channel id : 16 Bit unique identifier for programming network. 

Start Time: 32 Bit standard C and C++ *TIME_T* data format for description of time. 

End Time: 32 Bit standard C and C++ *TIME_T" data format for description of time. 

Advertisement Unique Descriptor Variable length character string describing advertisement 
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6. Staging Server Functions 

The Staging Server is a Sybase Conceptual Database entity which will serve many purposes in the trial. The main 
purpose in it's conception is a method to distribute many storage and communications processes off-line from the 
IMS which will be very busy with processor-intensive and time-critical actions controlling the VTE and handling 
signaling sessions. The actual hardware on which this system will reside may depend on performance we experience 
when the system comes on-iine. In-depth documentation on the many functions of the Staging Server will be 
handled by Sybase eventually when these designs come to a state of completion. This section will act as a rough 
overview of the Staging Server processes involved in the Clicks tream System. 



6.1. Upload Communications Procedure: Event Capture 



6.1.1. RPC Endpoint 

The Staging Server will act as the endpoint of the initial upload procedure of STB Oickstream information. When 
IMS receives a Level 1 Pass-Through packet with Oickstream data, the packet will be addressed to a particular 
process running on the Staging Server. This "Event Capture" prococess rung on the Staging Server will maintain 
an open connection with the Level I Pass-Through running on IMS. 

6.2. Persistent Storage 

The Event Capture process will provide for flat-file storage of da as is is delivered to the Staging Server over 

upstream communications. Data will then be decompressed and parsed from this flat file. 

6.3. Content Merge with Metadata 

Oickstream data must be brought together with contextual information on programming and advertising from both 
broadcast and interactive services. If this is done so in an effective manner, then analyses on the resultant data mil 
provide an enormous amount of value to BellSouth. The processes of merging the data could be done in a number of 
different ways. The method advocated in this design document reflects a consideration for avaUabdity and format ot 
data, system band widths, processing capabilities, storage requirements, and analysis capabilities. 
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Figure 24 : Staging Server Overview 



The merging methods break down into three parts. 

1) first a complete timeline is constructed for each Set-Top Box for an enure 24 hour Day period Tbsis 
done by inserting content identifiers into the Clickstream data, and by generating additional Chckstream data when 
content has changed and the subscriber continued to view the same programming channel or servee 

2) Filtering is done to "weed out" any extraneous data. This part of the merging process w,ll have to evolve 
over time as we learn more about the types of data which have proved to be valuable, and those which haven L 
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3) The timeline is further analyzed to provide a second timeline. This second data output is a list of content 
items "watched" by the STB and a list of content items "viewed" by the STB. These lists are generated based on 
decision criteria provided by BellSouth Marketing and Advertisement departments. 



These steps are repeated for each Set-Top Box in the system. 



Output 



Clicks tream 
Data 




^alue-Adferf* 
Clicks tream 
Timeline 



P revue Guide 
Data 



Broadcast 

Advertising 

Data 



Session-Services 
Advertising 
Data 



Session-Services 
Programming 
Data (SMS) 




Merge 
& 
Parse 
Engine 



"View" aid 

"Watch" 

Lists 



"View" and 
"Watch" 
Criteria 



"Value-Added 

Click^trcam 

Timeline 



Figure 25: Merging & Parsing Overview 



Merging and parsing should work on 24 hour segments of data at once to minimize the occurrence of un resolved 
events This is to say that aickstream Events are in and of themselves just pieces, and to do analyses on just 
several hours of data will leave the determination of programs being "watched" and "viewed" at the endpoints 
unresolved. 
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Clickstream Data 





Timestamp 


Application ID 


# Bytes to 
Follow 


Application S 
Event ID 


peel lie Data 
Channel ID 


Event Record 


3:59:30pm 


Cable App. 


4 bytes 


Soft Power On 


ABC 


Event Record 


4:00:00pm 


Cable App. 


4 bytes 


Channel Up 


NBC 


Event Record 


4:0£17pm 


Cable App. 


4 bytes 


Channel Up 


TBS 


Event Record 


4:06:25pm 


Cable App. 


4 bytes 


Channel Dwn 


NBC 


Event Record 


4:15:45pm 


Cable App. 


4 bytes 


Channel Up 


TBS 


Event Record 


4:55:45pm 


Cable App. 


4 bytes 


Soft Power Off 


None 



P revue Guide Data 



\ 

0 





Content ID 


Channel ID 


Start Time 


End Time 


Content Record 


National News 


ABC 


3:30:00pm 


4:00:00pm 


Content Record 


Murphy Brown 


NBC 


3:59:00pm 


4:30:00pm 


Content Record 


Simpsons 


NBC 


4:30:00pm 


4:59:00pm 


Content Record 


N.G.E\plorcr 


TBS 


3:05:00pm 


4:05:00pm 


Content Record 


Andv Grifith 


TBS 


4:05:00pm 


4:35:00pm 


Content Record 


NBA Basketball 


TBS 


4:35:00pm 


6:05:00pm 



Broadcast Advertising Data 



Content Record 



Content Record 



Content Record 



Content Record 



1 

5T Cont ent Record 



Con 



ombined Data 



Content ID 



Coca Cola #10 



Visa #2 



Ddta #1 



Ddta #3 



Visa #21 



Channel ID 



NBC 



NBC 



TBS 



TBS 



TBS 



Start Time 



4:00:30pm 



4:0l:00pm 



4:03:30pm 



4:04:00pm 



4:04:30pm 



End Time 



4:01:00pm 



4:01:30pm 



4:04:00pm 



4:04:30pm 



4:05:00pm 




-Parse and 
Merge 
Data 





Timestamp 


App. ID 


# Bytes to 
Follow 


Application Specific Data 
Event ID Channel ID 


Content ID 


Event Record 


3:59:30pm 


Cable App. 


6 bytes 


Soft Power On 


ABC 


National News 


Event Record 


4:00:00pm 


Cable App. 


6 bytes 


Channd Up 


NBC 


Murphy Brown 


Event Record 


4:00:30pm 


Cable App. 


6 bvtes 


Change Content 


NBC 


Coca Cola #10 


Event Record 


4:0l:00pm 


Cable App. 


6 bvtes 


Chance Content 


NBC 


VtsafT 


Event Record 


4:01:30pm 


Cable App. 


6 bvtes 


Change Content 


NBC 


Murphy Brown 


Event Record 


4:03:17pm 


Cable App. 


6 bvtes 


Channd Up 


TBS 


N.G.Expiorer 


Event Record 


4:03:30pm 


Cable App. 


6 bvtes 


Change Content 


TBS 


Ddta#l 


Event Record 


4:04:00pm 


Cable App. 


6 bvtes 


Change Content 


TBS 


Ddta #3 


Event Record 


4:04:30pm 


Cable App. 


6 bytes 


Change Content 


TBS 


Visa #21 


Event Record 




Cable App. 


6 bvtes 


Change Content 


TBS J 


■ N.G. Explorer 


Event Record 


4:05:00pm 


Cable App. 


6 bvtes 


Change Content 


TBS 


Andv Grifith 


Event Record 


4:06:25pm 


Cable App. 


6 bvtes 


Channd Dwn 


NBC 


Murphy Brown 


Event Record 


4: 15:45pm 


Cable App. 


6 bvtes 


Channd Up 


TBS 


Andv Grifith 


Event Record 


4:35:00pm 


Cable App. 


6 bytes 


Change Content 


TBS 


NBA Basketball 


"Event Record 


4:55:45pm 


Cable App. 


6 bytes 


Soft Power Off 


None 





2 

Pi 

4 * 

* '< 
/ 




ure 26: Example Content Merge 
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7- Clickstream Sizing Estimates 

Two sets of numbers were put together to start to get a handle on possible data rates and storage needs for the 
network, and the back end systems to support the Clickstream system. 

7.L Maximum Sizing Estimates 

This is a maximum possible approach to understand the absolute upper limit on clickstream bit rates, and storage 
capacities during the trial period 

For the first analysis every STB in the system is being used 24hours a day, 7 days a week, and someone is changing 
broadcast channels once a second. This is an extreme scenario, but one which should present us with absolute 
maximum data and storage rates for the Clickstream system. 

Assumptions: • Maximum possible click rate after STB filtering = 

1 click per second uniformly distributed over 24hr x 7days 

• 4000 subsribers for the trial period. 

• Uniform upload process through appropriate control mechanisms. 

• Average number of bytes per click = 14 uncompressed. 

• Compression rate of 50% can be acheived for uploads. 

• 70 Broadcast channels in system 
Conclusions: • Max number bits/second uploaded through system: 

224,000 bits/second for system 
7,000 bits/second per RT 

28,000 bits/second per Stottcd-Aloha Modulator 

( 18% of gross bandwidth, about 36 % of net bandwidth) 

• Max number bytes of raw data to be stored per day: 

•Clickstream: 4,838 Gigabytes per day 

• Content Metadata: 2 Megabytes per day 

7.2. Reasonable Sizing Estimates 
For the second analysis, every STB in the system is being used 24 hours a day, 7 days a week, and *™<^<^ 
changing the broadcast channel once a minute. This is a more reasonable scenario m that during peak channel 
Surfing ' data rates are likely to be higher on a per STB basis. This is balanced with the fact that neither duration of 
viewing or overall system viewship rates are expected to every be this high. 

Assumptions: • Reasonable click rate after STB filtering = 

1 click per minute uniformly distributed over 24hr x 7days . 

• 4000 subsribers for the trial period. 

• Uniform upload process through appropriate control mechanisms. 

• Average number of bytes per click = 14 uncompressed. 

• Compression rate of 50% can be acheived for uploads. 

• 70 Broadcast channels in system 
Conclusions: • Max number bits/second uploaded through system: 

3,733 bits/second for system 
117 bits/second per RT 

467 bits/second per SIottcd-AIoha Modulator 

(.03% of gross bandwidth, about .06% of net bandwidth) 

• Max number bytes of raw data to be stored per day: 

• Clickstream: 80 Megabytes per day 

• Content Metadata: 2 Megabytes per day 
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8. MKIS System 

8.1. Persistent Storage 

8.2. Hardware Interfaces 
8*2.1. Staging Server 

8.2.2. Cable Data 

8.2.3. IMS (indirect?) 

8.2.4. I3/OpUmark Upload 

8.2.5. 3rd Party Analysis Upload 

8.2.6. Marketing Report Generation Ethernet Interface 

8. 3. Content Merge Engine - 

8.3.1. Prevue Metadata 

8.3.2. Advertisement Metadata 

8.3.3. Advertising Traffic Controller Metadata 

8.3.4. IMS Generated Metadata 
8.4. Analysis Capabilities 

8.4.1. Rough Overview 
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9. Defining Terms 

The following are some terms used in this document and alternatives you may have encountered elsewhere: 

13 Formerly CONDO : Consortium of New Database Operators. A group of companies which will be providing an 
Analysis capability for Clickstream information, and will act as an interface between the advertising community and 
BellSouth. 

Clickstream System: A demograptocs/programming ratings collection system which will function on the 
BellSouth IVSN. This document is an initial attempt to define the scope of this functionality, and the means we 
may use to make it happen. 

STB- Set-Top Box. The "Cable Converter" that sits on 'Top" fin this case possibly on bottom) of the 

subscribers television set. The device provides a platform on which content is converted to the NTSC video format 
and presented to the subscriber, and input is taken from the subscriber and transmitted to the cable operator. 
Analogous Terms: Set-Top Terminal (STT), Cable Converter, Home Communications Terminal (HCT), 
"Richmond"(Sdenufic-Atlanta model name for this device). 8600X, 8600XDi. 

Subscriber The end user of the system, the person who subscribes to the service Traditionally in consumer 
electronics the "consumer" is the end user of the products or systems produced- In Cable Television the •'consumer 
is typically a cable operator or large Multiple System Operator (MSO). so the term subscriber was adopted. 

Upstream: Generic term used for a data path provided by any mechanism FROM the subscriber to the headend, 
server, or billing and management databases. If the system is viewed as billing and f*^"'^^"^ 
top video server and headend below. Level 1 system below that, and finally the subscriber and the STT at the 
bottom, then data flows "Upstream" or "Downstream". Analogous Terms: Reverse data path. 

TDMA, slotted ALOHA: The two mechanisms for upstream communication from the STT to the server. 

Downstream: Generic term used for a data path provided by any mechanism FROM billing or mamgement 
computer to server or headend, and FROM server/headend to the subscriber/STB. Analogous Terms: Forward data 

path. 

Event: An action or change in a STT state deemed important to the budding of a taowledge base on the 

subscriber. Examples include key presses to change channels, enter Navigator, turn STT off/on, etc. Also includes 
passive changes such a change of programming content while STT tunes single channel such commercial break, 
changes to different commercials, changes to next program, etc. 

Clickstream: Term used to describe a flow of STT event data upstream to the HP server complex. The 
Clickstream then flows further upstream to the 13 database where demographic and programming ratings intelligence 

can be gathered and formatted 

ATOM: A content identifier coding scheme. ATOM codes will be used to determine programming ratings 

and subscriber viewing habits. 

PowerTV: The operating system used on this model of STB. It must provide for most, if not all of 

applications functionality through the use of APIs. 
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